Low-dimensional representation of Gaussian mixture model supervector for language recognition

نویسندگان

Jinchao Yang

Xiang Zhang

Hongbin Suo

Li Lu

Jianping Zhang

Yonghong Yan

چکیده

In this article, we propose a new feature which could be used for the framework of SVM-based language recognition, by introducing the idea of total variability used in speaker recognition to language recognition. We consider the new feature as low-dimensional representation of Gaussian mixture model supervector. Thus we propose multiple total variability (MTV) language recognition system based on total variability (TV) language recognition system. Our experiments show that the total factor vector includes the language dependent information; what’s more, multiple total factor vector contains more language dependent information. Experimental results on 2007 National Institute of Standards and Technology (NIST) Language Recognition Evaluation (LRE) databases show that MTV outperforms TV in 30 s tasks, and both TV and MTV systems can achieve performance similar to that obtained by state-of-the-art approaches. Best performance of our acoustic language recognition systems can be further improved by combining these two new systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supervector LDA: A New Approach to Reduced-Complexity I-vector Language Recognition

In this paper, we extend our previous analysis of Gaussian Mixture Model (GMM) subspace compensation techniques using Gaussian modeling in the supervector space combined with additive channel and observation noise. We show that under the modeling assumptions of a total-variability i-vector system, full Gaussian supervector scoring can also be performed cheaply in the total subspace, and that i-...

متن کامل

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

Noise Compensation for Speech Recognition Using Subspace Gaussian Mixture Models

In this paper, we adress the problem of additive noise which degrades substantially the performances of speech recognition system. We propose a cepstral denoising based on the Subspace Gaussian Mixture Models paradigm (SGMM). The acoustic space is modeled by using a UBM-GMM. Each phoneme is modeled by a GMM derived from the UBM. The concatenation of the means of a given GMM leads to a very high...

متن کامل

Gaussian Mixture Model Weight Supervector Decomposition and Adaptation

This report proposes a novel approach for Gaussian Mixture Model (GMM) weights decomposition and adaptation. This modeling suggests a new low-dimensional utterance representation method, which uses a simple factor analysis similar to that of the i-vector framework. The suggested approach is applied to the Robust Automatic Transcription of Speech (RATS) language identification evaluation corpus,...

متن کامل

Novel Gaussianized vector representation for improved natural scene categorization

We present a novel Gaussianized vector representation for scene images by an unsupervised approach. Each image is first encoded as an ensemble of orderless bag of features. A global Gaussian Mixture Model (GMM) learned from all images is then used to randomly distribute each feature into one Gaussian component by a multinomial trial. The posteriors of the feature on all the Gaussian components ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

EURASIP J. Adv. Sig. Proc.

دوره 2012 شماره

صفحات -

تاریخ انتشار 2012

Low-dimensional representation of Gaussian mixture model supervector for language recognition

نویسندگان

چکیده

منابع مشابه

Supervector LDA: A New Approach to Reduced-Complexity I-vector Language Recognition

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Noise Compensation for Speech Recognition Using Subspace Gaussian Mixture Models

Gaussian Mixture Model Weight Supervector Decomposition and Adaptation

Novel Gaussianized vector representation for improved natural scene categorization

عنوان ژورنال:

اشتراک گذاری